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Abstract 

In this paper we consider the transmission of classical information 
through a class of quantum channels with long-term memory, which 
are given by convex combinations of product channels. Hence, the 
memory of such channels is given by a Markov chain which is aperiodic 
but not irreducible. We prove the coding theorem and weak converse 
for this class of channels. The main techniques that we employ, are 
a quantum version of Feinstein's Fundamental Lemma EH an d a 
generalization of Helstrom's Theorem. [Jj . 



1 Introduction 



The biggest hurdle in the path of efficient information transmission is the 
presence of noise, in both classical and quantum channels. This noise causes 
a distortion of the information sent through the channel. Error-correcting 
codes are used to overcome this problem. Instead of transmitting the original 
messages, they are encoded into codewords, which are then sent through the 
channel. Information transmission is said to be reliable if the probability of 
error, in decoding the output of the channel, vanishes asymptotically in the 
number of uses of the channel (see e.g. [Sj and JB]). The aim is to achieve 
reliable transmission, whilst optimizing the rate, i.e., the ratio between the 
size of the message and its correponding codeword. The optimal rate of 
reliable transmission is referred to as the capacity of the the channel. 

Shannon, in his Noisy Channel Coding Theorem ^Bj , obtained an explicit 
expression for the channel capacity of discrete, memoryless 1 , classical chan- 
nels. The first rigorous proof of this fundamental theorem was provided by 
Feinstein jS]. He used a packing argument (see e.g. [TQJ ) to find a lower bound 
to the maximal number of codewords that can be sent through the channel 
reliably, i.e., with an arbitrarily low probability of error. More precisely, he 
proved that for any given 5 > 0, and sufficiently large number, n, of uses of a 
memoryless classical channel, the lower bound to the maximal number, N n , 
of codewords that can be transmitted through the channel reliably, is given 
by 

Here H(X : Y) is the mutual information of the random variables X and Y, 
corresponding to the input and the output of the channel, respectively. We 
refer to this result as Feinstein's Fundamental Lemma, following Khinchin 
|10j . It implies that for a real number R < C, where C = max H(X : Y), 
(the maximum being taken over all possible input distributions), M n < 2 nR 
classical messages can be transmitted through the channel reliably. In other 
words, any rate R < C is achievable. 

For real world communication channels, the assumption that noise is un- 
corrected between successive uses of a channel cannot be justified. Hence 
memory effects need to be taken into account. This leads us to the consider- 
ation of quantum channels with memory. The first model of such a channel 
was studied by Macchiavello and Palma [12 . They showed that the trans- 
mission of classical information through two successive uses of a quantum 
depolarising channel, with Markovian correlated noise, is enhanced by using 

^or such a channel, the noise affecting successive input states, is assumed to be per- 
fectly uncorrelated. 
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inputs entangled over the two uses. An important class of quantum channels 
with memory consists of the so-called forgetful channels. The channel stud- 
ied in ^2] falls in this class. Roughly speaking, a forgetful channel is one for 
which the output after a large number of successive uses, does not depend on 
the initial input state. Forgetful channels have been studied by Bowen and 
Mancini [2 and more recently by Kretschmann and Werner [11] . In ^T], cod- 
ing theorems for arbitrary forgetful channels were proved. The proof of the 
direct channel coding theorem for a class of quantum channels with Marko- 
vian correlated noise, where the underlying Markov Chain was aperiodic and 
irreducible, was sketched out in 0J. Very recently Bjelakovic and Boche pQ 
have proved a coding theorem for causal ergodic classical-quantum channels 
with decaying input memory. 

The capacities of channels with long-term memory (i.e., channels which 
are "not forgetful"), had remained an open problem to date. In this paper we 
evaluate the classical capacity of a class of quantum channels with long-term 
memory. The tool that we develop to prove the relevant coding theorem, can 
be considered to be a quantum analogue of Feinstein's Fundamental Lemma 
[IJ. For a quantum memoryless channel, our method yields an alternative 
proof of the Hole vo- Schumacher- Westmoreland (HSW) Theorem Ej, sim- 
ilar in spirit to the proof in ,19J. 

We start the main body of our paper with some preliminaries in Section 
121 Our main result is stated in Section El For clarity of exposition, we 
follow this with a proof of the quantum analogue of Feinstein's Fundamental 
Lemma for memoryless channels in Section The proof of our main result, 
for a class of quantum channels with long-term memory, is given in Section 

El 

In summary, in this paper we consider the transmission of classical infor- 
mation through a class of quantum channels with long-term memory, which 
are convex combinations of product channels (defined through (jSJ) of Section 
EJ). The memory of the channel is given by a Markov chain which is aperi- 
odic but not irreducible. We prove the coding theorem and weak converse 
for this class of channels. The main techniques that we employ are a quan- 
tum version of Feinstein's Fundamental Lemma jSl El and a generalization 
of Helstrom's Theorem [7]. Our results can be extended to quantum chan- 
nels with arbitrary Markovian correlated noise. The proofs in this case are 
technically more involved and will be presented in a subsequent paper. 
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2 Preliminaries 

Let B(7i) denote the algebra of linear operators acting on a finite-dimensional 
Hilbert space TC. The von Neumann entropy of a state p, i.e., a positive op- 
erator of unit trace in B(TC), is defined as S(p) = — Trplogp, where the 
logarithm is taken to base 2. A quantum channel is given by a completely 
positive trace-preserving (CPT) map $ : B(H) — > £>(/C), where Ti and K, 
are the input and output Hilbert spaces of the channel. Let dim 7i = d and 
dim /C = d! . For any ensemble {pj,Pj} of states pj and probability distribu- 
tions {pj}, the Holevo x quantity is defined as 



where the maximum is taken over all ensembles {pj,pj} of possible input 
states pj G B(7i) occurring with probabilities pj. It is known that the maxi- 
mum in (j2J) can be achieved by using an ensemble of pure states, and that it 
suffices to restrict the maximum to ensembles of at most d 2 pure states. 

Let us consider the transmission of classical information through succes- 
sive uses of a quantum channel Let N uses of the channel be denoted by 
& n \ Suppose Alice has a set of messages, labelled by the elements of the set 
Ai n = {1,2,..., M n }, which she would like to communicate to Bob, using 
the quantum channel $. To do this, she encodes each message into a quan- 
tum state of a physical system with Hilbert space TC® n , which she then sends 
to Bob through n uses of the quantum channel. In order to infer the message 
that Alice communicated to him, Bob makes a measurement (described by 
POVM elements) on the state that he receives. The encoding and decoding 
operations, employed to achieve reliable transmission of information through 
the channel, together define a quantum error correcting code (QECC). More 
precisely, a code of size N n is given by a sequence {p\ n \ E^}^ where 
each pf 1 ^ is a state in B{7i® n ) and each is a positive operator acting 
in /C® n , such that Ylf=i < I n . Here I n denotes the identity operator in 
£(/C® n ). Defining E° n = J n -E& E\ n \ yields a resolution of identity in /C 0n . 
Hence, {E[ }f= a defines a POVM. An output i > 1 would lead to the infer- 
ence that the state (or codeword) p\ n ^ was transmitted through the channel 
$( n ) ; whereas the output is interpreted as a failure of any inference. The 




(1) 



The Holevo capacity of a quantum channel $ is given by 



:= max x(fe,*(p,)}) 



(2) 
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average probability of error for the code is given by 

Pe(C (n) ) ■= (l " Tr ($ W (p! n) )4 n) )) , (3) 

If there exists an N G N such that for all n > N, there exists a sequence of 
codes {CW}™!, of sizes N n > 2 nR , for which P e (C (n) ) -> as n -> oo, then 
i? is said to be achievable rate. 

The capacity of $ is defined as 

£($):= sup i?, (4) 

where R is an achievable rate. If the codewords p\ , i = 1,2, .. . ,N n , are 
restricted to product states in £>(7i® n ), the capacity C($) is referred to as 
the product state capacity. 



3 Main Result 

In this paper we study a class of channels with long-term memory. For a 
channel $ in this class, ^ : B(H® n ) -> £(/C® n ) and the action of $W on 
any state p^ G B(H® n ) is given as follows: 

M 

$W( p H) = ^ 7i $f«(pW), ( 5 ) 
i=i 

where $ f : -> £(£), (i = 1,...,M) are CPT maps and ^ > 0, 

J2i-Li Ji — 1- Notice that this is an example of a quantum channel with 
memory given by a Markov chain, which is aperiodic but not irreducible ^3] . 
Our main result is given by the following theorem. 

Theorem 3.1 The product state capacity of a channel $, with long-term 
memory, defined through (EJ), is given by 



C(<&) = sup 



M 



A i=1 Xi({pj,Pj}) 



where Xi{{Vji Pj}) '■= X ({Pji *&i{Pj)})- The supremum is taken over all finite 
ensembles of states pj G B(TC) with probabilities pj. 

Here we use the standard notation A to denote the minimum. 

The product state capacity can be generalized to give the classical ca- 
pacity of the channel $ in the usual manner, that is, by considering inputs 
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which are product states over uses of blocks of n channels, but which may be 
entangled across different uses within the same block. The classical capacity 
^classical (^) ^ s obtained in the limit n — > oo and is given by 

Qa SS ical( < f)=ilm ( 6 ) 

4 Analogue of Feinstein's Fundamental Lemma 
for a Memoryless Quantum Channel 

In this section we prove an analogue of Feinstein's Fundamental Lemma [Sj 
for a memoryless quantum channel This is given by Theorem 14.11 below. 
It provides an upper bound to the maximal number of codewords that can 
be sent reliably through $. 

The proof of our main result, Theorem 13.11 employs a theorem which is 
a generalization of Theorem 14.11 

Theorem 4.1 Let $ : B(7i) — > B(K.) be a memoryless quantum channel. 
Given e > 0, there exists hq 6 N such that for all n > uq there exist at least 
N n > 2 n ( x *(*) _e ) product states . . . , p^l G B(7i® n ) and positive operators 

E[ n \ ...,E ( £e B{lC® n ) such that Ek=i E ¥ < J « and 



Tr 



for each k. 

Here x*{&) ^ s the Holevo capacity of the memoryless quantum channel 



Before giving the proof of Theorem 14.11 let us briefly sketch the idea 
behind it. The proof employs the idea of construction of a maximal code. 
For a given e > 0, starting with an empty code, the proof gives a prescription 
for successively adding codewords pj and corresponding POVM elements 
Ej, j = 1,2,..., such that 

:=1-Tr$( n )(pf } ) <e (8) 

Note that is the probability of error in inferring the j th codeword. This is 
done until no more codewords can be added without violating the condition 
(|5|). The resulting code is maximal. Let the size of this code be N n . The 
proof ensures that the number N n is large and provides a lower bound for it 
in terms of the Holevo capacity x*(&)- 
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Proof. Let the maximum in (J2J) be attained for an ensemble {Pj, Pj}j=i- 
Denote Oj = $(pj), o = Y,j=iPj&(Pj) an d o n = a® n . Since a n is a product 
state, its eigenvalues and eigenvectors can be labelled by sequences k = 

Choose 5 > 0. We will relate 5 to e at a later stage. There exists n% G N 
such that for n > n\, there is a typical subspace of /C® n , with projection 
P n such that if a n has a spectral decomposition 

(9) 
(10) 

(11) 
(12) 



then 



-log A, 



(n) 



5"(a) 



e 

< 3 



for all such that \ip^) G T e " and 



Tr{P n a n ) > 1-5 2 



Further define 



s = Y,Pj s ( a i 

3=1 



Lemma 4.1 Given a sequence j = (ji, ■ ■ ■ ,j n ) G J n , let P^ be the pro- 
jection onto the subspace of K,® n spanned by the eigenvectors of o 



(n) 



<jj n with eigenvalues X^l = n^=i ^um suc ^ ^ ria ^ 



n 



n 



logA^+5 



< 



For any 5 > there exists n 2 G N snc/i t/iat /or n > n 2 , 



E 



Tr (afpf 



>l-5 2 



(13) 
(14) 



Proof. Define i.i.d. random variables Xx, . . . , X n with distribution given 

by 

¥(Xi = Aj, fc ) = pj X jt fc , (15) 

where Aj^, = 1, 2, . . . , d! are the eigenvalues of Oy By the Weak Law of 
Large Numbers, 

n J d' 

- Yl lo S X i ~* E ( lo S X i) = HI! Pi X i,k log hk 

n i=l j=l k=l 

-T,pj 3 (°s) = - s - ( 16 ) 
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It follows that there exists ri2 such that for n > 112, the typical set Tj") of 
sequences of pairs k±), . . . , (j n , k n )) such that 



n ■ 



< 



satisfies 



Obviously, 



and 



((ii,fci),...,(iu,fcn))eT E ( " ) i ~ 1 



p( n ) > 
3 — 



E 



(17) 



(1* 



(19) 



k=(k-^,...,k n ) 



(n) 



E (Tr (afp^)) > P (7f )) > 1 - 5 2 . 



(20) 

□ 

Continuing the proof of the theorem, let N n be the maximal number N for 
which there exist product states p[ \ . . . , p$ on 7i® n and positive operators 
E{ n \...,EP on JC® n such that 

(i) E£i 4 n) < ^ and 

(ii) Tr[^4 n) ]>l- e and 

(iii) Tr[a„4 n) ] < 2-"[ 5 ^)-^-l< 

Herea£ n) = $®"(pt n) ). 

For any given j G J n define 

f =^-1^) PnPfPn^-E^J ■ (21) 



Clearly, V^ (n) < P„ - E*=i and we also have: 
Lemma 4.2 

Tr(a n K (n) ) < 2-"[ s ^-I e ]. 



^N n rp(n) 



(22) 
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Proof. Put Q n = J2k=i Ek- Note that Q n commutes with P n . Using the 
fact that P n a n P n < 2~ n ^-^ by (O, we have 



Tr(a n y; 



Tr 
Tr 



0~n(,Pn Qn) ^ PnPj Pi(Pi Qn) ^ 



Pn@nPn^yPn Qn) ^ Pj (Pi Qn 



.1/2 



< 2"^-^Tr \{P n - Q n )y 2 pf{P n - Q 



,1/2 



< 2-" [5(<f H e] Tr {P\ n) ) < 2 



-n[S(5-)-S-|e] 



(23) 



where, in the last inequality, we used the standard upper bound on the 
dimension of the typical subspace: Tr(P 3 - ) < 2 n ^ + ^ £ \ which follows from 
Lemma 14.11 □ 
Since N n is maximal, it now follows that 



< 1 - e. 



(24) 



and hence 
Corollary 4.1 



E Tr 



u 3 1 3 



< 1 



(25) 



Lemma 4.3 For all 1] > 0, there exists n 3 6 N such that for all n > n 3 , 



E 



(Tr 



A™) p p\ n ) p 

u j r n r j r n 



> 1 - 7]. 



(26) 



Proof. We write 
E^Tr ^"'p p"" 



^.W p pW p 

3 3 n 



E (Tr [afP^]) - E (Tr [af\l n - P n )P^ 



Tr 



(27) 



By Lemma 14. 1[ the first term is > 1 — 5 2 provided n > n 2 . The last two 
terms can be bounded using the Cauchy-Schwarz inequality as follows: 



E 



(Tr [af\l n -P n )Pf 
= E(Tr [(<£>) V2 (/ n - P n) pf (af) V2 

< |e (Tr [(^-P^af^-P^ 

^ ; I IV 



1 1/2 



1/2 



<{ E (Tr [of (I„-P„)])} 

1/2 



{E(^[aW(/ B -P B )])}^{E(Tl[ ff fpW])} 

1/2 



1/2 



Tr 



On(-^n Pn 



< 5 



by (|TT|) provided n>n\. Analogously, 



E 



(Tr [afP^^-P^ 



< 5. 



Choosing = n\ V n 2 and «5 2 + 2<5 < 77 the result follows. 
Lemma 4.4 Assume r\ < |e. Then for n > 113, 



Tr 



iV 
fc=l 



E Tr 



N 

fc 

k=l 



> V 



Proof. Define 

Qn P« (Pi Qn) ^ • 

By the above corollary, 

l-e > ^{Tr(af(P n -Q' n )P[ n \P n -Q' n ))} 

= E{Tr (a^PnPfPn)} 

-E{Tr (of Q n Pf >P B ) + Tr (of P n Pf ><&) } 

+E{ T r (4 n) g;pfQ; t )}. 

Since the last term is positive, we have, by Lemma f4. 31 

E {Tr (of Q n Pf } P„) + Tr (of P n P[ n) Q' n ) } > e - V > 2 V 

On the other hand, using Cauchy-Schwarz for each term, we have 
E{Tr (<rf Q n Pf P n ) +Tr (,f P n pf Q' n )} 

< 2 {e [Tr (Q' n *fQ' n )] } V2 {e [Tr (of P n Pf ) P, 

< 2 {E [Tr (afg; 2 



J 1/2 



J 1/2 



Thus, 



E 



Tr (<rf Q'l 



> 
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To complete the proof, we now claim that 

Qn > (Q' n ) 2 . 



(36) 



Indeed, on the domain of P n , (jHE|) follows from the inequality 1 — (1 — x) 2 > x 2 
for < x < 1. □ 
To complete the proof of Theorem 14.11 we now have by assumption, 



Tr 



cr n E, 



(n) 



< 2 - n [S(ff)-S-| e ] 



(37) 



for all k = 1, ... , N n . On the other hand, choosing 77 < |e and 5 < |r], we 
have by Lemma f4.4[ 



Tr 



fc=l 



> V 



provided n > 713. It follows that 
for n > n.3 and n > — - logr?. 



(38) 

(39) 
□ 



5 A class of channels with long-term memory 

We now consider the class of quantum channels with long-term memory, 
mentioned in the Introduction: 



M 



(40) 



i=l 



where : B(H) -> B(/C), (i = 1,...,M) are CPT maps and j { > 0, 
For an ensemble of states where G B(H), define 

x{{Pj,Pj}) ■= A i=1 Xi({Pj'PA)i ( 41 ) 
where Xi({Pj,Pj}) = X ({Pj, ®i(Pj)})- 

5.1 Proof of the direct part of Theorem 13.11 

To prove the direct part of Theorem 13.11 i.e., the fact that a rate R < C(<&) 
is achievable, we employ the quantum analogue of Feinstein's Fundamental 
Lemma for the class of channels defined by (j4"Uj) . This analogue is given by 
the following theorem, which we prove in Section 15.1.11 
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Theorem 5.1 Given e > 0, there exists Uq G N such that for all n > n 
there exist at least N n > 2 n ^ c( ^- e ^ product states p^ , . . . , pff G B{H® n 
and positive operators , 
such that for each k — 1, . . 



Here 



C($) :- 



Tr 



sup 



"° G B(/C 

*(») ( p W) 



suc/i i/iai 2^ -E^ n ' ) < I n and 



N 



> 1 



A,^iXi({Pi>Pi}) = SU P x({Pj,Pi}), 



(42) 



(43) 



where the supremum is over all finite ensembles of states pj with probabilities 

The above theorem implies that a rate R < C($) is achievable. This can 
be seen as follows: Given an R < C($), choose e > such that R < C($) — e. 
Then, Theorem 15.11 guarantees the existence of codes of size 



N > 2< c ^)-^ > T 



with codewords given by product states Pj , and POVM elements EY lJ , for 

which the probability of error, e^, can be made arbitrarily small, for each 
j G {1,2,..., N n } and n large enough. Hence the rate R is acheivable. 



R 



5.1.1 Proof of Theorem 15.11 

Choose an ensemble {Pj,Pj}j=i such that 

p.}) + I e . (44) 

Define a Lj = ®i(pj), cr}j = <g)" =1 cr ijr , a { = J2j=iPj^i{Pj) = ®i(p), and 
= af n . Let ^ , i — 1, . . . , M, be the orthogonal projections onto the 
typical subspaces for the states so that, as above, 

Tr(P/ n) af n) ) > 1 - 5 2 (45) 

for n large enough, and 

P^a^P^ <2- n ^-H (46) 

By Lemma 14.11 there also exist typical subspaces with projections Pjj for 
which 

E(Tr (<t$P$)) > 1 - (47) 
11 



for n large enough. 

To distinguish between the different memoryless branches, $j, of the 
quantum channel $, we add a preamble to the input state encoding each 
message in the set M. n . This is given by an m-fold tensor product of a suit- 
able state (as described below). Let us first sketch the idea behind adding 
such a preamble. Helstrom [2j showed that two states o\ and o"2, occurring 
with a priori probabilities 71 and 72 respectively, can be distinguished, with 
an asymptotically vanishing probability of error, if a suitable collective mea- 
surement is performed on the m-fold tensor products af m and crf m , for a 
large enough m G N. The optimal measurement is projection-valued. The 
relevant projection operators, which we denote by Il + and IT~, are the orthog- 
onal projections onto the positive and negative eigenspaces of the difference 
operator A m = ■ji<jf n — 72crf n . Here we generalize this result to distinguish 
between the different branches If the preamble is given by a state u® m , 
then, by using Helstrom's result, we can construct a POVM which distin- 
guishes between the output states af n := corresponding to the 
different branches $j, i — 1,2, ... , M. The outcome of this POVM measure- 
ment would in turn serve to determine which branch of the channel is being 
used for transmission. 

Notice that we may assume that all branches <3>j are different. Indeed, 
otherwise we do not need to distinguish them and can introduce a compound 
probability for each set of identical branches. This assumption means that 
there exist states u>i j on 7i for each pair 1 < % < j ' < M such that ^(a^j) 7^ 
Qjiuiij). Introducing the fidelity of two states as in [T5| . 

F(a, a') = Tr Va 1 / 2 ^ a 1 / 2 , (48) 

we then have 

F($ i K i ),$ i K i ))</<l (49) 

for all pairs We now introduce, for any m G N and 1 < i < j < M, the 

difference operators 

4? = n(M"i,j)T m - i^M,)f m - (so) 

Let n*,- be the orthogonal projections onto the eigenspaces of A\™' corre- 
sponding to all non-negative, and all negative eigenvalues, respectively. 

Lemma 5.1 Suppose that for a given 5 > 0, 

|Tr[|4?|]-(7i + 7i)l<5. (51) 
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Then 

|iv[n^K,on-i|<^: (52) 

and 

livfn-.^K^n-il^^-- (53) 

Proof. Write A = Aff and II* = nf,. First note that 

Tr [n± A] = -Tr [A ± (n+ - IT) A] 
= \(Tr[A]±Tt[\A\]) 

= \^-1 3 )±\k[\A\] (54) 
so that we have by the assumption 

|Tr [n+A]- 7i |<^ (55) 

and 

|Tr [Yl-A} +1] \< l -5. (56) 

Now, writing a, = ($j(a;i ,j))® m and aj = j(uo it j))® m we have obviously, 
Tr [n - (7j] > 0, and on the other hand, 

7l Tr [n-^] = Tr [IT A] + 7j Tr [IT^] < - 7j + ^5 + 7j = ±5. (57) 

The first result thus follows from IT 4 " + Il~ = I m and Tr<jj = 1. Similarly, 

7j Tr [n+^-J = -Tr [n+ A] + 7i Tr [IT^] < - 7i + ^5 + 7l = ^. (58) 

□ 

To compare the outputs of all the different branches of the channel, we 
define projections IT on the tensor product space ®i<i<j<M ^ <X>m — K,® mL 
with L = (^fj as follows: 

!I m if ii^i and i 2 ^ i 
n-, if 12 = i (59) 
n+ i2 if i x = i. 
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Notice that it follows from the fact that n^-IL.- = 0, that the projections Ilj 
are also disjoint: 

UiUj = for i ^ j. (60) 



Introducing the notation 



(mL) 



U! 



il<i 2 



we now have 



Lemma 5.2 For all i — 1, ... , M, 

lim Tr [n i $f mi (u {mL) ) 

Proof. Notice that for all i < j, 



1. 



Using the inequalities [T3] 

Tr (Ax) + Tr (A 2 ) - 2F(A 1; A 2 ) < \\A 1 - A 2 \\i < Tr (Ai) + Tr (A 2 ) 
for any two positive operators Ai and A 2 , we find that 

|Tr(|4 ( :; ) |)-(7 l + 7 J )l<2/ m , 



since 



Tr = |iT^Ki) 0m -7i^-K,)^|ii 



Using Lemma f5. II we then have 



1 > Tr 



= II ^ PU^^l U^Pt^M) 

i\<% %i>% 



□ 



We now fix m so large that 



Tr 



1% $f mL (^ (mL) ) 



> i -5 



(61) 



(62) 



F^^^A'K^™) = v ^77F(^(o; iJ ),^K j )) m < f m - (63) 



(64) 

(65) 
(66) 



(67) 



(68) 
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for all i = 1, . . . , M. The product state u>( mL \ defined through (JoTj) is used 
as a preamble to the input state encoding each message, and serves to dis- 
tinguish between the different branches, i = 1,2,..., M, of the channel. 
If p£ G B{H® n ) is a product state encoding the k th classical message in the 
set A4 n , then the k th codeword is given by the product state 

u^®pf\ 



Continuing with the proof of Theorem l5.lt let N = N(n) be the maximal 
number of product states pi, . . . , pjy on 7^®" (each of which is a tensor 
product of states in the maximising ensemble {Pj, Pj}j=i) for which there 
exist positive operators E[ n \ . . . , E$ on fc® mL (g) K,® n such that 

(i) 4 n) = £*=i n, ® 4? and Eti < A (n) and 



(ii) £7,Tr [^^L ^L))] Tr [ $? n E (n 



(hi) ][>Tr [fl^f mL (u^)]Tr [{UP)f n Et} 



> 1 — e and 



< 2 -„[C(1>)-ie]_ 



i=i 



for p = Ej=iPjPj- For each z = 1, . . . , M and j = (ji, • • • , j n ) € J n , we 
define, as before 



/ N \ V 2 / 2V 

V fc=l / \ k=l 

Clearly ^f<^-Ef =1 ^. Put 

M 

V^^n,®^. (TO) 



1/2 



,(n) 



(69) 



This is a candidate for an additional measurement operator, pjv+i' f° r B°b 
with corresponding input state p^ +1 = p^f — Pji ® Pj 2 ■ ■ ■ ® Pin- Clearly, the 
condition (i), given above, is satisfied and we also have 



Lemma 5.3 

M 



5~>Tr [fT$f mL (w (mL) )]Tr [afV^] < T 



n[C(*)-4e] 



(71) 



1=1 
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Proof. By Lemma \A.2\ replacing |e by \e in the definition of the typical 
subspaces, we have, 



(72) 



for n large enough. Then 

M 



M M 



i=l 
M 



1=1 



< 2 -n[C(*)-|e] 



(73) 

where we used the obvious fact that Tr % $f mi (4 mi) )] < 1. □ 
By maximality of N it now follows that the condition (ii) above cannot 
hold, that is, 



M 

£ 7i Tr [fl^f mL (u^)} Tr [$f* (/><">) 



< 1 -e 



(74) 



i=l 



for every j , and this yields the following: 
Corollary 5.1 



M 

£ 7i Tr [fl, $f mi (w^))] E (Tr [$f* (p< n } ) 



< 1 - e. 



(75) 



We also need the following lemma: 
Lemma 5.4 For all > 5 2 + 35, 



M 

^7iTr [fli<5>f mL (oj {mL) ) 



i=l 



Tr 



(n) p(n) p(n) p(n) 



> 1 - V (76) 



ji/n zs /arge enough. 

Proof. Using Lemma f4. 31 and ()68)1 . we have 

M 

X>Tr [fl,$f mi (^( mL ))] E(Tr [a<$ P[ n) P$ P[ n) 
-i=i 

provided r\ > 5 2 + 25. Hence the result follows. 



>(l-5)(l- v ) (77) 
□ 
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Lemma 5.5 Assume rj' < |e and write 



N 



(») 



Then for n large enough, 

M 



fc=i 



E Tr 



> 7] . 



(78) 



(79) 



Proof. This is analogous to Lemma f4. 41 Define 



By the Corollary 15.11 



M 



E ( Tr 



i=i 



30) 



= ]T 7 ,Tr [Yi^f mL (u^)] E |Tr (^pf^P^)} 
i=i 

M 

-^ 7l Tr [fl^f mL (J mL ^)] E |Tr (a^Q^'P^P/^ 
i=l 

+Tr (^P^P^Q^)} 

M 

+ 5>T* [n,$r L (w^))] E{Tr (trg^S^'P^g^')} 

i=l 



(81) 



Since the last term is positive, we have, by Lemma f5 .41 

M 

]T Ti Tr [LT,$f mi (u^)] E{Tr (a^Q^'pffP^] 
i=i 

+Tr (rrWpWpWg^) }>e-rf> 2rf. (82) 

On the other hand, using the Cauchy-Schwarz inequality for each term, we 
have 

M 

X>Tr [n,$f mL (u^)] E{Tr (or^Q^'P^P^ 
i=l 
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+Tr (tr^pPpgQW)} 



< 



1/2 



< 



E 



Tr (a^{Qf^ 2 



< 



2^ 7i Tr [fli$f mL (u {mL) ) 

U=i 
( M 

1 i=i 

f M 

2 £ 7i Tr [n 4 $r L (c( mL ))] E[Tr (^(Q^' 



1/2 



E 



1/2 



(83) 



.4 = 1 



Thus, 



M 



i=l 



E 



^« ) (^ n) ') 2 )]>^ 2 - (8 4 ) 



To complete the proof, we remark as before that 

Qn > (Q' n ?- 

It now follows, as before, that for n large enough, N(n) > (t/) 2 2™^*)' 
We take the following states as codewords: 

p (rnL + n) =uj{mL)0/ -(n)_ 

For n sufficiently large we then have 

N n+mL = N{n) > (r/^PW-M > 2 (mL + n)lC(Z)-e] _ 

To complete the proof we need to show that the set E^ satisfies 
But this follows immediately from condition (ii): 

(mL+n)\ rp(n) 



55) 
□ 



(86) 

(87) 
1. 



Tr (pi mL+n) ) E, 

M 

]T 7l Tr [$f (mL+n) ® pl n) ) 4 n) 



i=i 

M 
At 



> 



fij- $f mL (w (mL) ) 

]T 7i Tr [ni$f mi (w (mi) ) 



Tr 



8=1 



Tr 



$f"(pi n) )4? 



> 1 - e. 



□ 
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5.2 Proof of the converse of Theorem 13.11 



In this section we prove that it is impossible for Alice to transmit classical 
messages reliably to Bob through the channel $ defined in (J40|) at a rate 
R > C($). This is the weak converse of Theorem 13.11 in the sense that the 
probability of error does not tend to zero asymptotically as the length of the 
code increases, for any code with rate R > C($). To prove the weak converse, 
suppose that Alice encodes messages labelled by a G M. n by product states 
Pof 3 = Pa,i ® ■ ■ ■ ® Pa,n in B(H® n ). Let the corresponding outputs for the i-th 
branch of the channel be denoted by cr^j, i.e. 



Further define 



a 



a. X 



Hp 



(n) 



(n) 



and 



a 



\M I ^ a ' j 



(89) 
(90) 

(91) 



Let Bob's POVM elements corresponding to the codewords p^f> be denoted by 
a = 1, . . . , \M. n \- We may assume that Alice's messages are produced 
uniformly at random from the set M. n . Then Bob's average probability of 
error is given by 



P, 



(n) 



\M n \ 



Tr W n \p^)E^ 



(92) 



aeMr, 



We also define the average error corresponding to the i branch of the channel 

as 



_(n) 
PiJ 



\M n \ 



so that 



E-(n) 
7iPi,e- 



(93) 



(94) 



Let A^ n ^ be a random variable with a uniform distribution over the set 
Ai n , characterizing the classical message sent by Alice to Bob. Let Y^ 1 
be the random variable corresponding to Bob's inference of Alice's message, 
when the codeword is transmitted through the i th branch of the channel. It 
is defined by the conditional probabilities 



[Y} n) =/3\xW = a] 



Tr [$r(pL n) )^] 



(95) 
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By Fano's inequality, 

MpS?) +P% MlMnl - 1) > H(X^ I if >) = H(XM) - : Yf } ). 

(96) 

Here /i(-) denotes the binary entropy and #(•) denotes the Shannon entropy. 
Using the Holevo bound and the subadditivity of the von Neumann entropy 
we have 



5 



< E 



S i 



y ( j (n) 



CtG-Mn 



E s(<r£>) 



Ex* 

3=1 



\M n 



'■> P a ,j 



aeM n ' 



The latter expression can be rewritten using Donald's identity: 
E Pi s ( u j 1 1 P) = E Pj s ( u i 1 1 °) + s (° 1 1 P) > 

where a; = J2jPj^j- We apply this with p replaced by 

1 n 
n\jv\. n \ j= i a&Mn 



(97) 



(98) 



(99) 



and the sum replaced by a double sum over j and a with states a l a -. This 
yields 

in in 

E E 5(<,i|a M ) = — -e e s(<* II + II **,,■)■ 

(100) 

But, it follows from convexity of the relative entropy that the second term 
on the right-hand side is zero: 



o < sfa || <7ij) < - E^K^ II = o- 

n i=i 



101) 
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Inserting into (|97|) we now have: 



Fano's inequality (jHEJ) now yields 

— n v.- I <r 

n|.M T 



(102) 



h(pf:J)+PU log l-Mnl > log \M n \ -n Xl \{ ZTTTliP«,3 I > (103) 



However, since 

M 



^AHfenJ (104) 

and -R = 1 log |.M„,| > C($), there must be at least one branch i such that 

(n) C($) + 1/ra 

pj? > 1 - 1 } R ' > 0. (105) 

We conclude from (}94"j) and ()105j) that 

Pl^f!-^,)^,. (106) 



i=l 



□ 
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